Metaqueries for Data Mining
نویسندگان
چکیده
This chapter presents a framework that uses metaqueries to integrate inductive learning methods with deductive database technologies in the context of knowledge discovery from databases. Metaqueries are second-order predicates or templates, and are used for (1) Guiding deductive data collection, (2) Focusing attention for inductive learning, and (3) Assisting human analysts in the discovery loop. We describe in detail a system that uses this idea to unify a Bayesian Data Cluster with the Logical Data Language (LDL++), and show the results of three case studies, namely, discovering regularities from a knowledge base, discovering patterns and errors from a large telecommunication database, and discovering patterns and errors from a large chemical database. The patterns discovered using metaqueries are implication rules with probabilities. These rules can link information from many tables in databases, and they can be stored persistently for multiple purposes, including error detection, integrity constraints, or generation of more complex metaqueries. 15.1 Introduction Recent progress in knowledge discovery from databases (Cercone and Tsuchiya 1993; Piatetsky-Shapiro 1993) has shown that inductive hypothesis generation, deductive hypothesis veriication, and human intituation, are all crucial components for an eeective discovery system. Inductive learning is essential for generating hypotheses from data automatically, deductive database technology is a natural tool for gathering evidence in support of existing hypotheses, while human intuition (which may be inspired by the results of machine discovery) is necessary for generating and selecting the most promising hypotheses in a practical manner. However, the integration of these three components
منابع مشابه
Metaqueries: Semantics, complexity, and efficient algorithms
Metaquery (metapattern) is a data mining tool which is useful for learning rules involving more than one relation in the database. The notion of a metaquery has been proposed as a template or a second-order proposition in a language L that describes the type of pattern to be discovered. This tool has already been successfully applied to several real-world applications. In this paper we advance ...
متن کاملMetapattern Generation for Integrated Data Mining
Metapatterns (also known as metaqueries) have been proposed as a new approach to integrated data mining, and applied to several real-world applications successfully. However, designing the right metapatterns for a given application still remains a diiculty task. In this paper, we present a metapattern generator that can automatically generate metapatterns from new databases. By integrating this...
متن کاملAnswering Metaqueries over Hi (OWL 2 QL) Ontologies
Hi(OWL2QL) is a new ontology language with the OWL2QL syntax and a specific semantics designed to support metamodeling and metaquerying. In this paper we investigate the problem of answering metaqueries in Hi(OWL2QL), which are unions of conjunctive queries with both ABox and TBox atoms. We first focus on a specific class of ontologies, called TBox-complete, where there is no uncertainty about ...
متن کاملUsing Metagueries to Integrate Inductive Learning and Deductive Database Technology
This paper presents an approach that uses metaqueries to integrate inductive learning with deductive database technology in the context of knowledge discovery from databases. Metaqueries are second-order predicates or templates, and are used for (1) Guiding deductive data collection, (2) Focusing attention for inductive learning, and (3) Assisting human analysts the discovery loop. We describe ...
متن کاملar X iv : c s . D B / 0 10 60 12 v 1 7 J un 2 00 1 Computational Properties of Metaquerying Problems ∗
Metaquerying is a datamining technology by which hidden dependencies among several database relations can be discovered. This tool has already been successfully applied to several real-world applications. Recent papers provide only preliminary results about the complexity of metaquerying. In this paper we define several variants of metaquerying that encompass, as far as we know, all variants de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996